-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
avx vecwise ibutterfly #368
Conversation
604a246
to
24001a4
Compare
d7d969a
to
55aae90
Compare
24001a4
to
131e95e
Compare
55aae90
to
8dc2fed
Compare
131e95e
to
d5cb3d3
Compare
8dc2fed
to
8c2ac4e
Compare
Previously, ilyalesokhin-starkware wrote…
ok I guess one 64 value is 2 two twiddles. |
eaab17b
to
1a1a9aa
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 0 of 1 files reviewed, 3 unresolved discussions (waiting on @ilyalesokhin-starkware)
src/core/backend/avx512/fft.rs
line 197 at r1 (raw file):
// fft on b // a:bcid // fft on a
Done.
src/core/backend/avx512/fft.rs
line 213 at r1 (raw file):
// 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 let t = _mm512_broadcast_i64x4(std::mem::transmute(twiddle1_dbl)); // Apply i:abcd => d:iabc
Done.
src/core/backend/avx512/fft.rs
line 232 at r1 (raw file):
Previously, ilyalesokhin-starkware wrote…
ok I guess one 64 value is 2 two twiddles.
Done.
wdyt about the following: The twiddles for layer 0 are unique and arranged as follows: 0 1 2 3 4 5 6 7 8 9 a b c d e f. The twiddles for layer 1 are replicated in the following pattern: 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7. Code quote: // The twiddles for layer 0 are packed like: |
5191419
to
d8211fb
Compare
1a1a9aa
to
755eb0c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 0 of 1 files reviewed, 4 unresolved discussions (waiting on @ilyalesokhin-starkware)
src/core/backend/avx512/fft.rs
line 179 at r2 (raw file):
Previously, ilyalesokhin-starkware wrote…
wdyt about the following:
The twiddles for layer 0 are unique and arranged as follows: 0 1 2 3 4 5 6 7 8 9 a b c d e f.
The twiddles for layer 1 are replicated in the following pattern: 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7.
...
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed all commit messages.
Reviewable status: 0 of 1 files reviewed, 2 unresolved discussions (waiting on @spapinistarkware)
src/core/backend/avx512/fft.rs
line 179 at r2 (raw file):
Previously, spapinistarkware (Shahar Papini) wrote…
Done.
not pushed
src/core/backend/avx512/fft.rs
line 200 at r2 (raw file):
// The twiddles for layer 2 are packed like: // 0 1 2 3 0 1 2 3 0 1 2 3 0 1 2 3
Suggestion:
// The twiddles for layer 2 are replicated in the following pattern:
// 0 1 2 3 0 1 2 3 0 1 2 3 0 1 2 3
755eb0c
to
f70e9b0
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewable status: 0 of 1 files reviewed, 2 unresolved discussions (waiting on @ilyalesokhin-starkware)
src/core/backend/avx512/fft.rs
line 179 at r2 (raw file):
Previously, ilyalesokhin-starkware wrote…
not pushed
Done.
src/core/backend/avx512/fft.rs
line 200 at r2 (raw file):
// The twiddles for layer 2 are packed like: // 0 1 2 3 0 1 2 3 0 1 2 3 0 1 2 3
Done.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reviewed 1 of 1 files at r3, all commit messages.
Reviewable status: complete! all files reviewed, all discussions resolved (waiting on @spapinistarkware)
d8211fb
to
62450f3
Compare
f70e9b0
to
4e15286
Compare
62450f3
to
8718835
Compare
4e15286
to
ac428d9
Compare
8718835
to
bacfaff
Compare
ac428d9
to
c6ac5cc
Compare
bacfaff
to
c6032e8
Compare
c6ac5cc
to
3075822
Compare
c6032e8
to
f49fb21
Compare
3075822
to
8300fe1
Compare
663fc6d
to
ec7b594
Compare
8300fe1
to
823a462
Compare
This change is